An almost linear time algorithm for finding Hamilton cycles in 
sparse random graphs with minimum degree at least three. 



Alan Frieze* and Simi Haber 
Department of Mathematical Sciences, 
Carnegie Mellon University, 
Pittsburgh PA15217. 



October 24, 2012 



Abstract 

We describe an algorithm for finding Hamilton cycles in random graphs. Our model is the random 
graph G = G^^. In this model G is drawn uniformly from graphs with vertex set [n], m edges and 
minimum degree at least three. We focus on the case where m = cn for constant c. If c is sufficiently 
large then our algorithm runs in 0(n^^°'^-') time and succeeds w.h.p. 

1 Introduction 

The threshold for the existence of Hamilton cycles in random graphs has been known very precisely for 
some time, Komlos and Szemeredi [15], Bollobas [4], Ajtai, Komlos and Szemeredi [1]. Computationally, 
the Hamilton cycle problem is one of the original NP-complete problems described in the paper of Karp 
[13] . On the other hand Angluin and Valiant [2[ were the first to show that the Hamilton cycle problem 
could be solved efficiently on random graphs. The algorithm in [2 is randomised and very fast, 0(n log^ n) 
time, but requires Knlogn random edges for sufhciently large K > 0. Bollobas, Fenner and Frieze [7] gave 
a deterministic polynomial time algorithm that works w.h.p. at the exact threshold for Hamiltonicity, it is 
shown to run in 0(rt'^"'"°'^^^) time. 

The challenge therefore is to find efhcent algorithms for graphs with a linear number of edges. Here we have 
to make some extra assumptions because a random graph with cn edges is very unlikely to be Hamiltonian. 
It will have isolated vertices. It is natural therefore to consider models of random graphs with a linear 
number of edges and minimum degree 6 at least two. In fact minimum degree three is required to avoid the 
event of having three vertices of degree two having a common neighbor. For example, in the case of random 
r-regular graphs, r = 0(1) > 3, Robinoson and Wormald [T7], [IB] settled the existence question and Frieze, 
Jerrum, Molloy, Robinson and Wormald [llj gave a polynomial time algorithm for finding a Hamilton cycle. 
The running time of this algorithm was not given explicitly, but it is certainly 17 (n'^). 

We will work on a model where the assumption is that 5 > 3 as opposed to all vertices having degree exactly 
three. It is tempting to think that existence results for the regualr case r = 3 will help. Unfortunately, this 
is not true. The random graph G*-^ is uniformly sampled from the set Gni?i of graphs with vertex set [n], 
m edges and minimum degree at least three. 

Frieze, ,9^ gave an 0(n^'*"°*^^-') time algorithm for finding large cycles in sparse random graphs and this can 
be adapted to find Hamilton cycles in G^-^^ in this time for sufficiently large c. The paper 10: gives an 
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algorithm that reduces this to n^-^+°(^'> for sufficiently large c. The main aim of this paper is to construct 
an almost linear time algorithm for this model. 

Theorem 1.1. If c is sufficiently large then our algorithm finds a Hamilton cycle in Gf^J^^, m = cn, and 
runs in 0{n}^°^^^) time and succeeds w.h.p. 

Remark 1.1. The n°^^'' term here is (logn)'^'^'°s'°s"^ which is tantalisingly close to best possible (?) log*"'^^-' n. 

2 Outline of the paper 

The paper [5] gave an efficient algorithm for finding the maximum matching in a sparse random graph. Its 
approach was to (i) use the simple greedy algorithm of Karp and Sipser [14] and then (ii) augment it to a 
maximum matching using alternating paths. In this paper we replace the Karp-Sipser algorithm with the 
algorithm 2greedy that w.h.p. finds a 2-matching in G = G^=^ with O(logn) components and we replace 
alternating paths with extensions and rotations. (A 2-matching is a spanning subgraph of maximum degree 
at most two). 

In Section[3]we will describe our algorithm. We will describe it in two subsections. We will describe 2GREEDY 
for finding a good 2-matching M in detail in Section IXT] In section we will describe an algorithm extend- 
ROTATE that uses extensions and rotations to convert M into a Hamilton cycle. In Section|4]we discuss some 
"residual randomness" left over by 2greedy. In Section [5] we prove some structural properties of G^^j^. In 
Section [H] we prove some properties relating the output of 2greedy to the execution of extend-rotate. 
In Section [7] we do a final calculation to finsih the proof. In Section [5] we point to our difiiculties in proving 
nlog*^*-^-* n and in Section [S] we make some final remarks. 

3 Algorithm 

As already stated, there are two phases to the algorithm. First we find a good 2-matching M and then we 
convert it to a Hamilton cycle. We look first at how we find M. 

3.1 Algorithm 2GREEDY 

We greedily and randomly choose edges to add to M. Edges of M are deleted from the graph. We let 
b{v) G {0,1,2} denote the degree of v in M. Once b{v) = 2 its incident edges are no longer considered 
for selection. The vertex itself is deleted from the graph. Thus the graph from which we select edges will 
shrink as the algorithm progresses. We will use F to denote the current subgraph from which edges are to 
be selected. When there are vertices v of degree 2 — b{v) (or less) in F, we take care to choose an edge 
incident with such a vertex. Our observation being that there is a maximum cardinality 2-matching of F 
that contains such an edge. 

If every vertex -y of F had degree at least 3 — b{v) then we choose an edge randomly from edges that are 
incident with vertices v that have b{v) = 0. In this way, we quickly arrive at a stage where every vertex of F 
has b{v) = 1. At this point we use the algorithm of [8] to find a (near) perfect matching M*, which we add 
to M as our final 2-matching. 

We describe 2greedy in enough detail to make some of its claimed properties meaningful. 
We let 

• fj,he the number of edges in F, 

• Yk = {ve [n] : dr{v) = k and b{v) = 0}, A: = 0, 1, 2, 
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• Zh = {v G [n] : dr{v) = k and b{v) = 1}, k = 0, 1, 

• Y = {vG[n]: dr{v) > 3 and b{v) = 0}, 

• Z = {v G[n]: dr{v) > 2 and b{v) = 1}, 

• M is the set of edges in the current 2-matching. 

Note that V{T) = [n] \ {Yq U Zq) and that b{v) e {0, 1} for v e V{r). 

We will assume that the input to our algorithm is an ordered sequence am = (ei, 62, ... , em) where m = cn. 
Here = {ei, 62, ... , e^} are the edges of and am is a random ordering of Em- Once these orderings 
are given, the vertices and edges are processed in a deterministic fashion. Thus for example, if 2greedy 
requires a random edge with some property, then it is required to take the first available edge in the given 
ordering. 

We now give details of the steps of 

Algorithm 2GREEDY: 
Step 1(a) Fi 7^0 

Choose w e Yi . We choose v by finding the first edge in the ordering a that contains a member of Yi . 
Suppose that its neighbour in F is w. We delete the edge [v, w) from T add {v, w) to M and move v 
to Zo. 

(i) If w is currently in Y then move it to Z. If it is currently in li then move it to Zq. If it is 
currently in Y2 then move it to Zi. Call this re-assigning w. 

(ii) If b{w) = 1 then we move w to Zq and make the requisite changes due to the loss of other edges 
incident with w. Call this tidying up. 

Step 1(b): Fi = and ^2 7^ 

Choose V E ¥2- We choose v by finding the first edge in the ordering a that contains a member of 12- 
Suppose that its neighbours in F are wi,W2- 

We choose one of the neighbors at random, say wi. We move v to Z^. We delete the edge {v, wi) from 
r and place it into M. In addition, 

(i) If b{w\) = then put b{w{) = 1 and add the edge {v,wi) to M. Re-assign w\. 

(ii) If b{w-\) = 1 then we delete wi from F. Tidy up. 

Step 1(c): I2 = and Zi 7^ 

Choose V G Zi. We choose v by finding the first edge in the ordering a that contains a member of Zi. 

Let u be the other cndpoint of the path P of M that contains v. Let w be the unique neighbour of v 
in F. We delete v from F and add the edge {v, w) to M. In addition there are two cases. 

(1) If b{w) = then we re-assign w. 

(2) If b{w) = 1 then we delete vertex w and tidy up. 

Step 2: Yi = y2 = Zi = and y 7^ 

Choose the first edge {v, w) G E{T) in the order am incident with a vertex v £Y. We delete the edge 
{v, w) from F and add it to M. We move v from Y to Z. There are two cases. 

(i) If b{w) = then move w from Y to Z. 

(ii) If b{w) = 1 then we delete vertex w and tidy up. 
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step 3: Yi = Fa = Zi = y = 

At this point F wih be distributed as Gfrf^ for some fi where /z = O(i^). As such, it contains a (near) 
perfect matching M* 12; and it can be found in 0{i') expected time [S]- This step comprises 

Step 3a Apply the Karp-Sipser algorithm to F. W.h.p. this results in the construction of a matching 
that covers all but 0{l'^^^) vertices U = {ui, M2, . . . , Ui}. 

Step 3b Now find augmenting paths from M2i-i to U2i for i < £/2. This produces the matching M*. 
The output of 2GREEDY is set of edges M ^ M U M*. 

3.2 Extension-Rotation Algorithm 

We now describe an algorithm, extend-ROTATE that w.h.p. converts M into a Hamilton cycle. The main 
idea is that of a rotation. Given a path P = (ui, U2, ■ ■ ■ ,Uk) and an edge e = {uk, Ui) where i < fc — 2 we say 
that the path P' — (mi, . . . ,Ui, u^, u^^i, . . . , Uij^i) is obtained from P by a rotation, ui is the fixed endpoint 
of this rotation. We say that e is the inserted edge. 

Given a path P with endpoints a, h we define a restricted rotation search RRS{y) as follows: Suppose that 
we have a path P with endpoints a, h. We start by doing a sequence of rotations with a as the fixed endpoint. 
Furthermore 

Rl We do these rotations in "breadth first manner" , described in detail in Section [51 

R2 We stop this process when we have either (i) created v endpoints or (ii) we have found a path Q with 
an endpoint that has a neighbor w outside of Q. The path Q ~\- w will be longer than P. We say that 
we have found an extension. 

Let END{a) be the set of endpoints, other than a, produced by this procedure. Assuming that we did not 
find an extension and having constructed END{a), we take each x G END{a) in turn and starting with the 
path Px that we have found from a to x, we carry out R1,R2 above with x as the fixed endpoint and either 
find an extension or create a set of v paths with x as one endpoint and the other endpoints comprising a set 
END{x) of size i^. 

Algorithm extend-rotate 

Step ERl Let Ki, K2, . . . , Kr be the components of M where \Ki\ = max{|i^j| : j € [r]}. If Ki is a path 
then we let Pq — Ki, otherwise we let Pq = Ki \ {e} where e is any edge of Ki. 

Step ER2 Let P be the component of the current 2-matching M that contains Pq. If P is not a cycle, go 
directly to ER3. If P is a Hamilton cycle we are done. Otherwise there is a vertex u G P and 
a vertex v ^ P such that / = (w, v) is an edge of G, assuming that G is connected, see Lemma 
16.31 Let Q be the component containing v. By deleting an edge of P incident to u and (possibly) 
and edge of Q incident with v and adding / we create a new path of length at least |P| + 1 with 
vertex set equal to V{P) U V{Q). Rename this path P. 

Step ER3 Carry out RSS{iy) until either an extension is found or we have constructed v endpoint sets. 

Case a: We find an extension. Suppose that we construct a path Q with endpoints x, y such 
that y has a neighbour z ^ Q. 

(i) If z lies in a cycle C then let P be a path obtained from C by deleting one of the edges 
of C incident with z. Let now P — x,Q,y, z, R and go to Step ER2. 

(ii) If z = Uj lies on a path R = (wi, M2, . . . , Wfc) where the numbering is chosen so that 
j > k/2 then we let P — x,Q, y, z, . . . , ui and go to Step ER2. 
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Case b: If there is no extension then we search for an edge / = {p, q) such that p G END{a) and 
q G END{p). If there is no such edge then the algorithm fails. If there is such an edge we 
let Q be the corresponding path from p to q. We replace P in our 2-matching by the cycle 
Q + / and go to ER2. 



3.3 Execution Time of the algorithm 

The expected running time of 2greedy is 0{n) and w.h.p. it completes in 0{n) time with a 2-matching M 
with at most Ki logn components for some constant Ki > 0. This follows from the results of [8] and |10) . 

To bound the execution time of extend-rotate we first observe that it follows from [2 that RSS{i') can 
be carried out in 0(j/^ logn) time. We will take 



where 

a: (log log rt)^ 

] 1) 

logn 

where K is a sufficiently large positive constant and that c is sufficiently large. 

We now bound the number of executions of RSS{i'). Each time we execute Step ER3, we either reduce the 
number of components by one or we halve the size of one of the components not on the current path. So if 
the component sizes of Mq are ni, n2, . . . , then the number of executions of Step ER3 can be bounded by 

K + ^ log2 = 0(log^ n). 

i=l 

So the total execution time is w.h.p. of order 

n + (ni/2+o(-))2 iog2 n = O(ni+o(^)). 
This clearly suffices for Theorem ll.il 

We will now turn to discuss the probability that our algorithm succeeds after we have described 2greedy. 
We remind the reader that the analysis assumes that c is sufficently large. 



4 Residual Randomness 

Let G be a graph with an ordering of its edges and consider a run of 2GREEDYon that graph. At every 
point of time each vertex is in one of the sets Yq, Zo, Yi,Y2, Zi,Y and Z as defined above. 

We let the set of vertices that were removed from the graph while in Z be denoted by R. We call them 
"regular vertices" . These vertices are removed from F in the execution of a Step 1 or Step 2 of 2greedy 
and they are internal vertices of paths of M at the start of Step 3. 

For a vertex v let ty be the time at which 2greedy deletes v from F. Vertices w that are not deleted before 
the start of Step 3 are given i„ — oo. A vertex is early if < ti^^^ and late otherwise. An edge ei is punctual 
if i < (1 — a)m and tardy otherwise, where a is a small positive constant. 

When a vertex v £ R gets matching degree two we take the incident non-matching edge e with the lowest 
index in a to be its Z-witness. The fact that v E Z just before this happens implies that e exists. We let W 
denote the set of Z- witnesses. We next defne two sets Rq and Aq: 

We let 

Ro = (z R '■ V is early and the Z-witness of v is punctual} . 
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and 

Af) — {v : V has degree at least 4 in r„i-e } . 
We may now state and prove the main lemma of this section. 

Lemma 4.1. In what follows R.q,Aq are defined with respect to G and an ordreing of its edges. Let e — {x,y} 
be a tardy edge of G where x € and y € Aq. Let G' be the graph obtained from G by deleting e. Assume that 
running 2GREEDY on G up until Step 3 gives a 2-matching M and a witness set W and running 2greedy 
on G' up until Step 3 gives M' and W. Then M = M' and W = W . 

Proof We claim that up to time t^, 2greedy will delete the same vertices and edges from Tt and and 
then delete x from both. After this the two graphs will coincide and we are done. We do this by induction 
on t. This is clearly true ioT t — and assume that Ff and Fj differ only in e and t < t^. Note that the 
induction hypothesis implies that the sets Yq,Yi, . . . , Z are the same in F^ , FJ . This is because deleting edge 
e does not affect z's status, because e is after the Z-witness of x in the order a. It does not affect y's status, 
because the degree of y will be at least 3 after the deletion. 

Because the sets Yo,Yi, . . . , Z are unchanged, the choice of step is the same in Fj and Fj. The difference 
between the two graphs only affects the degrees of x and y and by construction, this is never enough to 
change a choice of edge. □ 

Remark 4.1. Suppose that e,j = (w, w) and that (i) v € Rq, (ii) w g Aq and (Hi) is tardy. Then replacing 
Ci by {v',w') such that (i) v' G Rq and (ii) w' £ Aq results in the the same output M,W. 

The net effect of this is that if we condition on all edges except for the tardy edges between Rq and Aq then 
the unconditioned tardy Rq : Aq edges are random. This is what we mean by there being residual randomness. 



5 Degree Sequence of 



The degrees of the vertices in G are distributed as truncated Poisson random variables Po{X; > 3), see for 
example [3] . More precisely we can generate the degree sequence by taking random variables Zi, Z2, . ■ . , Zn 
where 

P{Z, = k)= for i^ 1,2,..., n and k> 3, (2) 

klfsW 

where /, (A) = - ECo IT J > 1. 

Then we condition on Zi + Z2 + ■■■ Zn — 2m. The resulting Zi, Z2, . ■ . , Zn can be taken to have the same 
distribution as the degrees of G. This follows from Lemma 4 of [3 . If we choose A so that 

^, 2m A/,(A) 2m 

E(Po(A; > 3)) = — or = — 

n /3(A) n 

then the conditional probability, F{Zi + Z2 + ■ ■ ■ Z^, — 2m) — Vl{l/^Jri) and so we will have to pay a factor of 
0{-/n) for removing the conditioning i.e. to use the simple inequality P(yl | B) < F{A) /P{B). (This factor 
0(n^/^) can be removed but it will not be necessary to do this here). 

The maximum degree A in G is less than logn q.s0 and equation (7) of 3J enables us to claim that that if 
Vk,2 < k < logn is the number of vertices of degree k then q.s. 



n\ e 



for some constant Ki > 0. 



<Ki(^l + y^nA'=e-^/(fc!/3(A))j logn, 2<k< log 



(3) 



sequence of events, £„ occurs quite surely (q.s.) if V(^£n) = o{n for any C > 0. 
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In particular, this implies that if the degrees of the vertices in G are di^d2, . - ■ ^ dn then q.s. 

71 

Y,dr{d,-l) = 0{n). (4) 

2 = 1 

Given the degree sequence we make our computations in the configuration model, see BoUobas [5|. Let 
d = (di, d2, ■ . ■ ,dn) be a sequence of non- negative integers with 2m = cn. Let W = [2cn] be our set of points 
and let Wi = [di + ■ ■ ■ + di^i + l,di + ■ ■ ■ + di], i £ [n], partition W. The function <j) : W ~^ [n] is defined 
hy w & Given a pairing F (i.e. a partition of W into m = cn pairs) we obtain a (multi-)graph Gp 

with vertex set [n] and an edge {4>{u), 4>{v)) for each {u, w} S F. Choosing a pairing F uniformly at random 
from among all possible pairings of the points of W produces a random (multi-)graph Gp- 

This model is valuable because of the following easily proven fact: Suppose G G Gn,A, the set of (simple) 
graphs with vertex set [n] and degree sequence d. Then 

¥{Gf G I Gf is simple) = ^ 



\Qn,d.\ 

It follows that if G is chosen randomly from Qn.d, then for any graph property V 

^ - P(Gf is simple)- 

Furthermore, applying Lemmas 4.4 and 4.5 of McKay [16] we see that if the degree sequence of G satisfies 
then P(G_F is simple) — ri(l). In which case the configuration model can substitute for Qn^d (and hence 
^r^m) ill dealing with events of probability o(n~^/^). 

Lemma 5.1. W.h.p. 

(a) G^-j^ contains no set S [n],3 < s — \S\ < sq = ^ log^ n such that S contains at least s + 1 edges. 

(h) Let Wi denote the set of vertices v that are within distance £q — 2 log log n of a cycle of length at most 
2io m Gi^,^. Then w.h.p. \Wi \ < n^/^ hg^'^" n. 

(c) W.h.p. there does not exist a connected subset of Kc\ogn < s < n^^^ vertices that contain s/10 vertices 
of degree at most 30. Here Kc is some sufficiently large constant. 

Proof (a) The expected number of sets S containing |5| + 1 edges can be bounded by 

s=3 |S|=s £'>3s diH Vds=D i=l ■' J i \ ' / \ / 

di,...,d^>3 

Explanation: For ([6]) we choose a set of size s with vertices of degree di,d2, . ■ . ,ds > 3 and di + - ■ ■ + ds = D. 
The term 01=1 f3{\)d \ (™odulo 0{n^^^)) accounts for the probability of these degrees. We then choose s + 1 
configuration points and approximate the probablity that they are all paired with other points associated 

^ith s by {^y^\ We use Ed,+...+d.=D UU 37T = 7^ ^ get ©■ 

Continuing we observe that (D/cs)^'*+^ < (l + |)^ for D > 3s. Thisis clearly true for D < cs and follows 



by induction on D > cs. Therefore, 

E i^Tf^ i (<=•) 



j~)2s+2\D„D 

^ ^ S - ' ^2s+2g(A+3)s 



D>3s 
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Plugging this into ([7]) we get a bound of 



<0 



„i/2jE( s J yn{s + l)MX) 



1,2 \ 



/ 2 \ ^0 



= 0(1). 



(b) 



(We remind the reader that it is possible to remove the 0{n-^^^) factor here. This would be worth doing if 
we could reduce e to 0(loglogn/ logn). This should become apparant in the proof of Lemma 15^ equation 

(EH). 

Part (b) follows from the Markov inequality. 

(c) For a fixed s, the probability such a set exists can be bounded by 

|S|=s ^ ' ^ D>3s diH hds=-D i=l ^ J ■> ^ ^ ^ 

3<di,ie[s] 
di<30,iG[s/10] 

Explanation: We choose a set S and we let the degrees in S he di,d2, ... ,ds where D is the total degree. 
Since the induced subgraph is connected, it must contain a spanning tree. We weaken this to it must contain 
s — 1 edges. (J^i) enumerates the lower numbered point of the edges and then D*~^ enumerates the other 

possible endpoints and then (^ ^^-2^ ) ~ (c7i°°-i bounds the probability the selected pairs exist. 
We bound this by 

s-l / 30 . . . \ ''/lo 



«<-'(:)U) i:. (»)(£) ^'<^'-''-'(S^) 



9s/10 



6—1 -, / 30 



s/10 



for any positive ^. 

Now if A is large then /3(A) > Also, /3(A(1 + C)) < e^(i+«). Furthermore, 

^ A'(l + e)' < AV« ^ /Ae^+« ^ 
^ i! - 30! - I 30 

1=3 ^ 

We will take ^ to be small but fixed. Then the bound becomes 

/ „2 \ « 9s/10„9Aes/10 / / \pl+5\ ^ n2-s-l 



0>3, <1 + « 



D ■ 
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We observe that ifuo = (^i^''^)i> then ud+i/ud < D >9. 



So, 



D>3s ^ ^' D>3s ^ i=3s D>3s ^ ^ ^ 

For the last inequahty we use the fact that s is large and then <^ (1 + ^)^. 
Continuing, we get a bound of 

30\ «/10 



< 



_0(,3/2. /9(40e)ViV/Ae^\ 

= 0(1) 

if we take ^ = 1/10 and c and hence A sufficiently large. 



5\ * 



□ 



6 Finding a Hamilton cycle 

Wc assume that we have a path P with cndpoints a, b and we do rotations with a as the fixed endpoint to 
try to find an extension. In the next section we show that if no extensions are found, then w.h.p. we create 
sufficient endpoints other than b on paths of length equal to P. Throughout this description, we will assume 
that no extension is found i.e. all neighbors of endpoints turn out to be vertices of P. We associate the 
search with something similar to an alternating tree of matching theory. 



6.1 Tree Growth 

In this section we describe our search for a longer path than P using extend-ROTATE in terms of growing a 
tree structure, where each vertex determines a new long path. We expose what happens w.h.p. if we fail to 
find an extension. Let Aq = {b} and let Bo be the set of neighbors of b on P, excluding b's path neighbor. 
We now define the sets Ai,Bi,i = 1, . . . , and d = Uj<i(^i U i3j). Here every vertex v in Ai will be the 
endpoint of a path of the same length as P. It will be obtained from P by exactly i rotations with a as the 
fixed endpoint. Fix i > and let Ai = {vi,V2, ■ ■ ■ , Vk}- We build Sj+i by examining vi,V2, ■ ■ ■ ,Vk in 

this order. Initially Aj+i = -Bj+i = and we will add vertices as we process the vertices of A^. Fix v = vj. 
We have a path P„ with endpoints a, v. We consider two cases: 

Case 1: \Ci\ < Lq = ^ log^ n. 

Let Ny = {ui,U2, ■ ■ ■ ,Ud} be the neighbors of v, excluding its neighbor on Py. We also exclude from Ny 
those neighbors already in (as defined so far). Let wi be the neighbor of ui on Py that lies between ui 

and V for I = 1,2, . . . ,d. Let = {wi, W2, .... Wd}- We exclude from those vertices already in ^,+1 (as 
defined so far). We add Ny to B^+i and iV^ to Ai+i and we add edges {v,Uj) and {uj,Wj) to T. The edge 
{uj,Wj) will be called a lost edge. Furthermore, we define P^,^ = Py + {v,Uj) — {uj,Wj) and observe that 
has endpoints b,Wj. 

Case 2: |Cj| > Lq. 

Now let Ny = {ui, M2, . . . . U(i} be its neighbors as above. We now exclude from Ny those neighbors already 
in Cj+i (as defined so far) as well as those Uj for which wj G Cj+i. We define Ny and update ^j+i, Sj+i, T 
with this restricted A^^. 
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We define the subgraph T ~ T{P,b,k) as follows: It has vertex set Ck plus the edges of the form {v,Uj) 
and {uj,Wj) used above. T suggests a tree. It is usually a tree, but in rare cases it may be unicyclic. This 
follows from Lemma [5. II When this happens, some v G Ai (Case 1) has a neighbor in Bj,j < i. 

We see from this that w.h.p. T defined prior to the lemma has at most one cycle. By construction, cycles 
of T are contained in the first io levels. If there are two cycles inside the first io levels then there is a set S 
(consisting of the two cycles plus a path joining them) with at most Aio vertices and at least \S\ + 1 edges. 

We argue next that w.h.p. T can be assumed to grow to a certain size and we can control its rate of growth. 

Lemma 6.1. Let (3 be some small fixed positive constant. If c is sufficiently large, then for all paths P and 
endpoints b such that extension does not occur, w.h.p. 

(a) There exists k such that \Ck\ > Lq — logj,ri. 

(b) If Lq < \Ck\ < n-^ then \Ak+i\ G [2(1 — /3)c|Cfe|, 2(1 + /3)c|Cfc|], even if only punctual edges are used once 
l^fcl reaches size at least . 

(c) There exists kn = O(log^n) such that \ Ak„ \ E [(2c(l + l3))-^n^/'^+''"' ,2c{l + l3)n^/'^+^^]. 

(d) Let ki = kf) — £q where £q — 2 log log n and let x G A^-^ . Let S be the set of descendants of x in Ak^ and 
let s = \S\. Let So ~ {y S : d{y) > 30} and let sq — \Sq\. Then, where Wi is as in Lemma \5.1\ 

(i) x iWi and s> (2c(l - l3))''»-''^ /A implies that sq > 99s/100. 
(a) s < (2c(l + logn. 

Proof (a) Lemma 2.1 of [12] proves the following: Suppose that S is the set of endpoints that can be 
produced by considering all possible sequences of rotations starting with some fixed path P and keeping one 
endpoint fixed. Let T be the set of external neighbors of S. Here S" n T = 0. Then |r| < 2|S'| and U T 
conmtains strictly more than jS* U Tj edges. The assumption here is that the graph involved has minimum 
degree at least three. It follows from Lemma [STT] that \S\ > ji^logj,n. As a final check, if \Ck\ never reached 
Lq then it would have explored all possible sets of endpoints i.e. the breadth first search is no restriction. 

(b) If the condition in (b) fails then the following structure appears: Let (5 = 1 if T is not a tree and 
otherwise. Let EVEN{T) = lJi=o ^« ^^'^ ODD{T) = lJi=o where k is the number of iterations involved 
in the construction of T. Then with \EVEN{T)\ = I and \N{EVEN{T))\ = r we have (i) 2{l - 1) + S edges 
of T connecting EVEN(T) to Odd(T) (ii) r-l + 1 edges connecting EVEN(T) to N{EVEN(T)) \ Odd(T) 
and (ui) none of the l{n - r - I) edges between EVEN(T) and V \ N{EVEN(T)) are present. 

Assume first that T is actually a tree and that ^ < ti*^ so that the edges of T need not be punctual. 

Given the vertices of T and N{EVEN{T)), the probability of the existence of a T with Lq < I < n-^ and 
^ (1 ~ I3)cl can be bounded by 

^ V ' di>3,te[r+l-l] •' ^ ^'1=1+1 ■'^^ ^ i=2l •'^^ 'I 

Explanation: The probability that an edge exists between vertices u and v of degrees d„ and dy, given 
the existence of other edges in T, is at most 2m-2(;+r)+3 '^^ere d'^^ = d^ less the number of edges already 
assumed to be incident with u. Hence, given the degree sequence, the probability that T exists is at most 

, ^ l+r-1 I 2/-1 r+l~l 

\{d^ n d.{d.-^) n 

^ i=l i=l+l i=2l 
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(We dropped the +3 in 2m - 2{l + r) + 3). 

Here the first product corresponds to EVEN{T), the second product corresponds to ODD{T) and the final 
product corresponds to neighbours of T (not in T). 

We will implicitly use the fact that if c is sufficiently large, then so is A. 

We now simplify the expression ([8]) obtained for the probability to 



1 



2m-2il + r) 



^2r+2/-3 
r+TT 



E 



E n 



2'-l r+l-1 ^d,-l 



n 



J2\^i di=r+l-l \d,>3. ie[r+l-l] i=l+l 
l+r-1 

X 



(d,-2)! 11 (d.-l)! 



1 



/3(A) 

< o{V^ 

< o{V^ 
= o(V^ 



2m - 2(/ + r) 

E 



21-1 



n E 



di=r+/-l \\i=i + ld,>3 
'di>3,iell] 



using 



/3 

< o(\/^: 

- 0(^^ 



(A) 



A) 



2m 


-2(/H 


hr) 




1 




2 m 


-2(/H 


hr) 




1 




2m 


-2(/H 


hr) 


) 


2c 






1 




2m 


-2(/H 


hr) 




1 




2m 


-2(/H 


hr) 



_ _ (d,-2)! 

'+■''-1 ^2r+2;-3 



n E (rf. _ 



/3(A)'-+'-i 

i+r-1 ^2r+2;-3 /g^si 



/i(A)'-^ /2(A)'- 



(9) 



/3(A) 



r+l- 



T (t) 



'+''"^ 'ery (2cA)'- /2cA/i(A)^ 



^ ^ A/2(A) V f2ixy 



2e(l - /3)c(2cA)'- /4(1 - ;9)ec2A/i(A) 



A/2 (A) 



/2(A)2 



2e(l - /3)c(2cA)'- /4(1 - ;32)g^2;s^ 



A/2 (A) 



i-i 



(-) 

\2cn J 



l+r-l 



„3(/+r)V2cn 



(2cA)' 



4ec2A 



/-I 



/2(A) 



usmg m = cn 



usmg 



(10) 

using r/l < 2(1 - /3)c (11) 
/i(A) 



/2(A) 



l+r-l 



go(0 A'- 



2ecA 



i-i 



(12) 



since r = 0(0- 

We now count the number of such configurations. We begin by choosing EVEN(T) and the root vertex of 
the tree in at most n(j"j^) ways. We make the following observation about T. The contraction of the lost 
edges of the tree yields a unique tree on I vertices. We note, by Cayley's formula, that the number of trees 
that could be formed using I vertices is Reversing this contraction, we now choose the sequence of I 

vertices, Odd(T), that connect up vertices in EVEN(T) in {n — l){n — I — l)...{n — 2Z + 1) = (n — l)i ways. 
We pick the remaining r — I vertices from the remaining n — 21 vertices in ("~^') ways. These r — I vertices 
can connect to any of EVEN(T) in ways. Hence, the total number of choices for T is at most 



n~2l 
r-l 



r-l 



(13) 
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Combining the bounds for probability and choices of T, we get an upper bound of 

The expression (^)^ is maximized at x = XL Our assumptions imply that r < 2(1 — /?)d < XI. Hence, we 
have the bound 

2(l-/3)ci /,n.\2X ' 



using 



120c3e(^-^)° 

/2(A) > ^^2{l~l3)c 

for c sufficently large. 

We sum 0{n^^^) ■ e~^'^^ over all r and / with Lq < I < n^-^ and ? < r < (1 — /?)cZ and we get the probabihty 
to be at most 

for c sufficiently large. 

We now consider the probability of the existence of a T having Lq < I < nP-^ and r > 2(1 + /3)c/. Note that 
we can assume r < ZA < Zlogn here. 

The bound (fTO]) remains valid. Replacing r by r + 1 multiplies this by a factor 0(c7i^^e'/'') and so for this 
bound we can just assume that r = 2(1 + /3)cL This changes the 1 — /3 in (ITT|) to 1 + /3 and we replace p2|) 

o,v.)(i)'-'.».(^)'- 

We re-use and replace (ITU) by 

Our assumptions imply that r = 2(1 + /3)cZ > XL Hence, we have the bound 

/ . ( /3o(i+^y 

^ l2(i+/?)d; /2(A) j 

3/2, (f e 120(1 + /3)VV . , 



using 



120(1 + /3)2c3e(2+4«^ 

72 (Aj > 



(2(l+/?))2(l+/3) 

for c sufficiently large. 
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We sum 0{n^^^) ■ e ^'^^ over all r and I with Lq < I < n^'^ and r > 2(1 + (3)cl and we get the probabihty to 
be at most 

0(n^/2)g-/3cLo = (18) 

for c sufficiently large. 

We next consider the case where I > and we can only use O(n^) late edges. We will use ([13]), which is 
still an upper bound and only modify ([8|). Let 

bid, d', l-a)=('^) ((l-'^M.'l'^H.-.' ^ Vi _ afa^-'^' (l + O " 
\d'J [m)d \d'J V 

for d < A < logn. 
We replace ^ by 

l+r-l 



2m-2{l + r) 



X 



ELi <=r+i-l 



5Z $z n M. _ 1 1 n X! f^. _ 2V 1 1 n X! _ 



1 A2-+2'-3(l_a)2''-l 



2m-2(; + r); /3(A) 



r+l-l 



d'.>\ yK>0 y Y i=l di>3 ^ J \ '=2' '''-^ 

EUl d'i=r+l-l 

l+r-1 ^2r+2i-3(i_^)2r 



^"'^' U.n-2(i + .) j ^ " /.(A)"..-." - g^^°'/.(A)'-A(A)-^ (20) 

Explanation for (|T9| : di is the degree of vertex i and for i e [I], d[ is the "early" degree. The factor 
h{di,d[, 1 — a) is the probability that i has d- neighbors. 

Observe now that the expression in (l20l) is precisely 

times the expression in It follows that the probability bound ([T5|) can be replaced by 
We sum this over r to get the required conclusion. 

The case r > 2(1 + /3)cl for I < , using only punctual edges follows a fortiori from the previous analysis. 

We finally consider the case where T is not a tree. When this happens, it will be because of a unique 
(Lemma 15. ip edge introduced in Case 1. We can be handle this by multiplying our final estimates by 
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0{L'^n ^ log^ n). The factor 0{Lf^) accounts for choosing a pair of vertices in T in Case 1 and 0{n ^ log^ n) 
bounds the probabihty of the existence of this edge, given previous edges. 

Part (c) follows from (b). 

(d) If we consider the growth of the sub-tree emanating from x then we can argue that it grows as fast as 
described in (a) and (b). We just have to deal with the edges pointing into the part of T that has already 
been constructed. We can argue as for (|19p with a — o(l), since the chances of choosing an endpoint in T 
is 0(1) at each point. 

If a; ^ Wi then the descendants Di of x at levels fcg + i grow at a rate of at least two (i.e. jDi+il > 2|Di|) 
for O(loglogn) steps until \Di\ ^ logrt and after this will grow at a rate of al least 2c(l — (3). In which case 
the leaves of T^, the sub-tree of T rooted at a;, will constitute a fraction 1 — 0(l/c) of the vertices of T^- 
The result now follows from Lemma IS.lf c). 

If a; e Wi then \Di\ grows at a rate of at most 2c(l 4- /?) once it has reached size logn. □ 

Remark 6.1. It follows from this lemma that only 0(n^/^+'^('^' ) tardy edges are needed to build all of the 
instances of A^g needed by extend-rotate. // one looks at Section 4-3. 1 of fS^ one sees, in conjunction 
with equation (1) of that paper that the total running time of Step 3b of this paper is 0{n'^^^~^°^^'^) and so 
we can use this as a bound on the number of punctual edges examined by Step 3b. We can drastically reduce 
this in the same way we did for building the trees in EXTEND-ROTATE, but since we are only claiming our 
result for c sufficiently large and e <^ .005, this is not necessary, since there will w.h.p. be Q.(n^~'^^) tardy 
Rq : Aq edges, see Lemma \67E\ below. In other words, almost all of the tardy Rq : Aq edges are not using for 
tree building. 

The above lemma shows that Ak can be relied on to get large. Unfortunately, we need to do some more 
analysis because we do not have full independence, having run 2greedy. Normally, one would only have 
to show that END{a) is large for all relevant vertices a and this would be enough to show the existence 
w.h.p. of an edge joining a to 6 S END{a) for some a, b. We will have to restrict our attention to the case 
where a d Rq and b £ Aq, see Remark 14.11 So first of all we will show that w.h.p. there are many a € Rq, 
see Lemma 16.81 For this we need to show that every path we come across contains many consecutive triples 
u,v,w e Rq. In which case, an inserted edge (x, v) produces a path with an endpoint in Rq. We also need 
to show that w.h.p. there are many b £ Aq, see Lemma 16.71 We will also need to show that there are many 
edges that can be (a, b), see Lemma [6.51 

For the Lemma [6.31 below we need some results from [TU]. Let u = u{t) denote {y{t), z{t), ^{t)) and let 
u = u(t) denote {y{t), z(t), fi{t)) where y{t) etc. denotes the value oi y = \Y\,z = \Z\.jj, = \E{T{t)\ at time t 
and y{t) etc. denotes the deterministic value for the solution to the associated set of differential equations, 
summarised in equation (152) of that paper: 

^=A + B-C-1- ^=2C-2A-2B; ^ = -l-D. (21) 
dt dt dt 

where 

8AV2(A)/3(A)' 4A2/2(A)2' 2A/3(A)' 2A/2(A)' 

and fj{x) = e^-J2lZo fr- 

Lemma 7.1 of [10 proves that u{t) and u.{t) are close w.h.p.: 
Lemma 6.2. 

||u(t) -u(i)||i <n«/9, /or l<t<min{To, To} w.h.p.. 
Here Tq is a stopping time and Tq is a deterministic time such that w.h.p. Step 3 begins before min |To, To|. 
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Note that e < 1/9. Let 

i„ = n^l^-^ and p = n^/^ 

Equation (163) of [10] states that w.h.p. 

le^uW) - Ad = O [p-^ log' n + ^"Wlli ^ e = a, 6, c, 2. (23) 

Here 

0, 06 = i 6ic = i + ^ and 02 = 1 - 6'a - 6*6 - fi'c 
and is the proportion of steps in + p\ that are Step 1<^ or Step 2, if ^ = 2. 

Now if i = o(n) p, ~ f2(n)and A = i^(l) then we have from l[22|) that A,B = 0{z/n) and that 0b = 
0{z/n),6c = 0(z/n), 02 = 2 — o(l). Then from (pij) we see that z grows at the rate 2 — o(l) per time step, 
so long as < = 0(71) and hence z = o{n). 

It is shown in lOj that if c > 10 then w.h.p. A = up until the (random) time when Step 3 begins. See 
equation (190) of that paper. Furthermore, it follows from Lemma 16.21 that w.h.p. 

XI If < = jn^^'^ for some constant 7 then w.h.p. z(t) ^ 2t. 

X2 If t = jn^^'^ for some constant 7 then w.h.p. there will be 0(n^~^^) instances of Step 1 in [0,t]. 
X3 A = up until the start of Step 3. 

Lemma 6.3. W.h.p., all the paths in Steps 1 and 2 o/extend-rotate contain at least no — fi(n^~*'^/ logn) 
pairs of consecutive edges {u,v), {v,w) such that u,v,w G i?o- 

Proof First consider the steps in the range [0, iop/4]. It follows from XI that at the end of this period, 
there will w.h.p. be at least iop/S vertices in Z. Consider the edge {v, w) of Step 2 at some time in [0, iop/A]. 
The probability that w G Z is certainly fl(n^^) and the probability it has a punctual Z-witness is 1 — a — o(l). 
This holds regardless of the previous history, once we condition on an event that happens w.h.p. 

The probability that w G Z is certainly 0{n~^A) = 0{n~'^ logn). This implies that the number of times we 
create a component of M containing more than two vertices is 0(n^~^'^ logn). Thus almost all components 
of M at the end of the period [O.zop/4] consist of isolated edges. Let us assume then that there are at 
least Ain^~^ such edges where in the following Ai, A2, . . . , are positve constants. Let Si denote this set of 
components. 

Now consider the steps in the range [jop/4, iop/2] and consider the edge {v, w) of Step 2. We have w G ^^(5*1) 
with probability at least A2n~^. This is because w.h.p. the total degree of V{Si) will be n{n^~^) and 
the total degree of G is at most 2cn. The vertex w is early by construction. Also the Z-witness of w 
will be punctual with probability at least 1 — a — 0(n~^). We next observe that with probability at least 

(1 — (A))" — I — 0(1), this component will not be absorbed into a larger component in [zop/4, iop/2\. 
Thus, in expectation, at time iop/2 there is a set of A^n^~^'^ components of Al consisting of a path of length 
two with its middle vertex in i?o- A simple second moment calculation will show concentration around the 
mean, for 15*21. 

We can repeat this argument for the periods [iop/2, 3iop/4],[3jop/4, top] to argue that by time n^~^ , M will 
contain a set 5*3 of at least A^n^''^^ components consisting of paths of length four in which the internal 
vertices are all in R^. 

We can argue that w.h.p. at least half of the components in 6*3 will have both end vertices of degree at most 
3c. Denote these by 6*4. Indeed the number of edges incident with vertices of degree more than 3c is relatively 
small. Indeed, the expected number of such edges is asymptotically equal to X]fc>3c fc!/3(A) — = (6/3)^^^. 
The number of such edges is concentrated around its mean. If we assume degrees are independent and less 
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than logn then we can use Hoeffding's Theorem and then correct by a factor 0(n^/^) to condition on the 
total degree. Given this, we see that w.h.p. at least 2(1 — ec)^/3 of the components of ^3 will be created in 
two executions of Step 2 with the degree v less than 3c. 

(\ 2cn 
1 — — i^(l) a component C £ S3 will survive as a 

component of M until the end of Step 2. Because 1531 is 0{n^^^), this is true regardless of which other 
compponents in ^3 survive. The f2(n) in the denominator comes from the fact that w.h.p. Step 2 ends 
with \Z\ = fl{n). Let iS'4 denote this set of components and note that w.h.p. there will be at least A^n^^'^'^ 
components in ^4. 

Step 3 of 2GREEDY adds a matching M** that is disjoint from the edges in the contraction of S3 to a 
matching. This matching is independent of ^3. This implies that w.h.p. any cycle (or possibly path) of 
the union of M* and M** of length i > n^^, contains at least A^£n~'^'^ members of ^4. Here we are using 
concentration of the hypergeometric distribution i.e. sampling without replacement. 

In EXTEND-ROTATE we start with a path of length i = ^l{n/ logn) and w.h.p. every path is generated by 
deleting at most 0(log^ n) edges. This completes the proof of the lemma. □ 



6.2 Batches 

Let T{t) denote the graph F after t steps of 2greedy. Suppose that ti < t2 < n^~'^ and that 2greedy 
applies Step 2 at times ti,t2 and Step 1 at times ti < t < t2- We consider the set of edges and vertices 
removed from time <i to time t2, i.e. the graph T{ti) \ T{t2) and call it a batch. Note that batches are 
connected subgraphs since each edge/ vertex removed is incident to some edge that is also removed. 

We also claim that each batch w.h.p. is constructed within 0(log^ n) steps and contains 0(log'^ n) vertices. 
This follows from ^Oj as we now explain. Let ^ = + 2y2 + zi. Equations (67), (68), (69) of [lOj show that 

IE[C'-CI |v|]--(i-Q)-o(i) 

where 

' V /3(A) /2(A) ^ ^2 J^(^)2 • 

Lemma 6.2 of [TU^ shows that 1 — Q = —1^(1) if A = and X3 is our justification for assuming this. 

Thus the expected change in ( is — i^(l) when ^ > 0. We carry out Step 2 iff C = 0. Now ( can change by at 
most 0(A) = O(logn) and has a negative drift whenever it is positive. This implies that it must return to 
zero within O(log^n) steps. Another A < logn factor will allow at most logn edges to be removed in one 
step. By making the hidden constant sufficently large, we can replace w.h.p. by with probability l~0{n^^'^). 

Lemma 6.4. 

(a) W.h.p. there are at most n^~^^ vertices v G G that are within distance £0 = 2 log logn of 6 distinct 
hatches. 

(h) W.h.p. no vertex has degree more than 4 in a single batch. 
Proof 

(a) We bound the probability of being within distance £0 of s batches by 

Ps^Y.[ s ) Y[Hdist{v,B,) < £0 I dist{v,Bj) < 4, 1 < j < i)- 

Explanation: Here (" ^ ) is the number of choices for the start times of the batches 
-Bi, i?2, • • ■ , Bs- 
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We claim that for each i,v, 

¥{dist{v, B,) < £q I dist{v, Bj) < £o, 1 < j < i) = O i — ^— | . (24) 

This gives 

P. < exp{-(i^ - 2 - o(l))(loglogn)2s} . 

and assuming > 13, this imphes that the expected number of vertices within distance £o of 6 batches is 
less than n^~^^. The result now follows from the Markov inequality. 

Proof of l[24|) : Suppose that Bi is constructed at time ti. It is a subgraph of r(ti) and depends only on 
this graph. We argue that 

Ci loff^^^" n\ 
n ■ ^'^^^ 

Explanation: The 0{n-^°) term is the probability the batch B, is large. The term O fU2£l!2Jl\ in ^ 



arises as follows. We can assume that |7V£(t!)| < A " < log ° n, where Ni{v) is the set of vertices within 
distance £ of v. Suppose as in [3] we expose the graph F at the same time that we run 2greedy. For us it 
is convenient to work within the configuration model of BoUobas [5] . Assume that we have exposed Ng (w) . 
At the start of the construction of a batch we choose a random edge of the current graph. The probability 
this edge lies in Ni^{w) is 0(log^° n/n). In the middle of the construction of a batch, one endpoint of an 
edge is known and the the other endpoint is chosen randomly from the set of configuration points associated 
with F(t). The probability this new endpoint lies in Ntf,{v) is also 0{\og^° n/n) and there are only 0(log^ n) 
steps in the creation of a batch. 

(b) The probability that vertex v appears fc + 3 times in a fixed batch can be bounded above by 
^O(iog=n)-j ^^-j^fe ^ Q ^log^^y Indeed, if v has degree at least 3 at any time, then the probability its 

degree in the current batch increases in any step is O (■^). □ 

We now argue that there will be a sufficient number of tardy i?o '■ Aq edges. 

Lemma 6.5. W.h.p. there will he tardy Rq : Aq edges. 

Proof We first consider the set Fi of tardy edges e = (w, v) such that (i) u appears at least twice in the 
first n^^'^/10 edges and in at least 30 other punctual edges and (ii) vertex v has degree at least 30 and does 
not appear in the first n^^^/^ edges in a. It is straightforward to show that q.s. we have — Q{mn~^'^). 

Suppose that u satisfies (i). It loses at most 24 edges (Lemma I6.4f a).(b')') before the second edge incident 
with u is chosen and then u will be in Rq. This is because u will be in Z just before this point and will then 
be placed in R. and it will have at least six choices for a punctual .Z-witness. We use the fact that almost 
all of the first n^~'^ steps are Step 2 to see that the edges incident with u occuring in the first n^~^/10 steps 
will indeed be selected before time 

If V satisfies (ii) and loses at most 24 edges because of Step 1 in the first n^~'^ steps then v will be in Aq. 
This is because it will have degree at least six in F„i-e. □ 

We now consider the probability that Ako contains many vertices that lie in Ai = A2 U A3 where 
A2 — : V appears in the first n^^^^^ edges in cr| 

A3 = {w ^ A2 : w loses 24 edges because of Step 1 in the first n"'^"^ steps} . 
In the proof of Lemma [6. 5 1 we used the fact that 

if the degree of v is at least 30 and v ^ Ai then e Aq. (26) 
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Lemma 6.6. W.h.p., every extension-rotation tree T has \Akf, H Ai| < |j4fe„|/30. 
Proof We first estimate \Ako n A2I. We go back to (ITTl) and estimate, for fixed r, 

P(3r : n A2I > Z/100) < 0(n3/2) / \ / -Tf^; \ i \i 



-I J V /2(A) J VV200 

(27) 

Explanation: We have taken the RHS of pT)) and multiplied by a bound on the probability that there are 
at least //200 members of EVEN(T) appearing in the first n^~'^^'^ edges of a. Note the the permutation a 
is independent of T and that w.h.p. we will have \Aka \ > 1/2. This is because for most of the time, the tree 
grow at a rate at least 2c(l — /3). We use (fT7|) and not because we can only assume that r < 2(1 + /3)cl. 

Thus, 



■.\Ak„nA2\>l/100)< 

V /2(A) \ cn J 



Now we are interested here in the case where I — ri^/^+°(^) and so this is easily strong enough so that we 
can apply the union bound over r, I. 

We next estimate \Aka n A3I. We replace (gT]) by 
P(3r:|Afe„nA3|>//100)< 

' \r-l) \ /2(A) ) \l/2m)\2?>l/2m)\2QQm) ' 

Explanation; We have taken the RHS of ([T7|) and multiplied by a bound on the probability that there is 

a set of leaves S of size Z/200 such that at least 24Z/200 times during the first n^~'^ steps the vertex it;, the 

neighbor of the selected w, is in S. This is computed assuming that we have exposed the edges of T. Note 

that we have that in at most Z/200 times do we lose the edge of T incident with w € S, explaining the factor 

( ) 
V23V200/ ■ 

Equation ((28|) is replaced by 



P(3r:|Afc„nA3|>//100)< 
^ /2(A) ^ ' \ 231 J \200cnJ J " 

□ 

The next lemma puts a lower bound on |Ao H Akg \ (see Lemma l6.ip . 
Lemma 6.7. W.h.p. |Aon^fcJ > \Ak„\/2. 

Proof Let ki = ko — 2£o where £q ~ 2 log log n. and consider Ak-^ — {ai, 02, ... , a^}. Note that 

jjl/2+5e 

r > 



2c(l + P) log^" n 



Let Si be the number of descendents of in Akg and let s[ be the number of early descendents of Oi in 

^fco n Ai. 
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Let s'l be the number of descendents of a; in Ak^, that have degree at most 30. We observe from Lemma 
WVi h) that 

r 

\Ak,\=Y.s,>r(2c{\-P)f''-^^>r\oi°^-n. (30) 

Next let I — {i ^[r\: a; ^ Wi and Si > (2c(l — (3))''°^''^ / A] (where Wi is from Lemmas [5.1[l6.ip and observe 
that 

^ < r(2c(l - p)f»~^^ /A + 7ii/2 log^^o+i n(2c(l + < |/3. (31) 

It follows from Lemma [OT d) that 

s'l < Si/5 for i e I. 

It follows from Lemma 16.61 that w.h.p. 

r 

J2s'.<\Ako\/30. 

i=l 

Now, after using (pS)) . we see that 



5 3 30 



|Aon ^.J >^(s,-sn-^s: > ' ^ ^ ^ 

iG/ i=l 



We now consider going one iteration further and building 



□ 



Lemma 6.8. W.h.p. ^feo+i contains at least ri(n^/'^+^'^) vertices of Rq. Furthermore, we can find these Rq 
vertices by examining n}~^^ \ogn tardy Ro : Aq edges. 

Proof Assume from Lemmas l6.1l and r6.7l that Akg contains at least ni = ^^Jj^^p-^ vertices in Aq. Assume 
also from Lemma l673l that all of the paths corresponding to Akg have no — ^{n^^^^^ / logn) consecutive triples 
u,v,w Cz Rq. If the middle vertex v is the neighbour of an endpoint, then it yields a new endpoint of Ak^+i 
in Rq. Then the expected number of rotations leading to an endpoint in Rq is at least 

^ 3c2(l + ^) logn ni-2e X n ^ ' 

for some constant Ci > 0. 

We can claim a q.s. lower bound because almost all of the tardy Rq : Aq edges are unconditioned, see remark 
KT\ □ 



7 Finishing the proof 

We have argued that we only need to do £i = 0(log^ n) extensions w.h.p. The tardy Rq : Aq edges are our 
scarce resource of residual randomness. Remark 16.11 explains that we only need to use an o(l) proportion in 
building trees up to the fegth level. We will only use the result of Lemma for growing the first extension- 
rotation tree of each of the 0(log^ n) path extensions. Lemma [6.81 tells us that we only need to use an o(l) 
fraction of the available Rq : Aq edges for producing many paths that have an Rq endpoint. 

Consider a round of extend-rotate where we are trying to extend path P. We start with a path and then 
we construct a BFS "tree" . After the first tree construction of each round, we construct A^g and create one 
more level A^g+i. From Lemma [^751 we should obtain 17(71^/2+^'^) paths with early endpoints. Now we grow 
trees from each of these paths and try to close them using the set El = {/i, /2, . . . , /m} of unused tardy 
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Rq : Aq edges. We can examine these edges in a order. The probability that the next edge fi fails to close a 
path to a cycle is p — r2(n^/^+^^ x ^i/^+Se x So the probability we fail is at most F{Bin{M,p) < £i). 

Now Mp = n{n^^) ^ £i and so the Chernoff bounds imply that we succeed w.h.p. 

As final thought, although we have proved that we can find a Hamilton cycle quickly, being very selective in 
our choice of edges for certain purposes, the breadth first nature of our searches imply that we can proceed 
in a more natural manner and use all edges available to us. In the worst-case we would have to use the 
designated ones. 

8 Why not £ = o(i2gp^? 

In the proof of Lemma 16.11 we need to choose £o = 2 log log n so that 2^° ^ Lq of that lemma. But then 
in (|24p we want ^ log^° n. With some work we could replace the bound log^° n by 0{cY° which would 
allow us to take e — ^ \° g'°^ " ■ The catch here is that in this case we would need K to grow with c. This is 
not satisfactory and so we content ourselves for now with ([1]). 

9 Final Remarks 

We have shown that a Hamilton cycle can w.h.p. be found in 0(n^+°^^'')) time. It should be possible 
to replace n°^^^ by log"^^^-* n and we have explained the technical difiiculty in Section |51 We think that 
O(nlog^n) should be possible. It should also be possible to apply the ideas here to speed up the known 
algorithms for random regular graphs, or graphs with a fixed degree sequence. 
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